Electronic Journal of Statistics 

Vol. 2 (2008) 332-367 

ISSN: 1935-7524 

DOI: 10.1214/08-EJS197 

The needlets bispectrum* 

Xiaohong Lan 

Institute of Mathematics, Chinese Academy of Sciences and 
Department of Mathematics, University of Rome Tor Vergata 
e-mail: lanOmat . unironia2 . it 

Domenico Marinucci 

Department of Mathematics, University of Rome Tor Vergata 
e-mail: marinuccOmat .uniroina2 . it 

Abstract: The purpose of this paper is to join two different threads of 
the recent literature on random fields on the sphere, namely the statisti- 
cal analysis of higher order angular power spectra on one hand, and the 
construction of second-generation wavelets on the sphere on the other. To 
this aim, we introduce the needlets bispectrum and we derive a number of 
convergence results. Here, the limit theory is developed in the high resolu- 
tion sense. The leading motivation of these results is the need for statistical 
procedures for searching non-Gaussianity in Cosmic Microwave Background 
radiation. 

AMS 2000 subject classifications: Primary 62G20; secondary 62M15, 
60B15, 60G60. 

Keywords and phrases: Bispectrum, Needlets, Spherical Random Fields, 
Cosmic Microwave Background Radiation, High Resolution Asymptotics. 

Received February 2008. 

Contents 



1 Introduction 333 

2 Some background material 334 

2.1 A Review on Needlets 334 

2.2 Diagram Formula 338 

3 A Central Limit Theorem for the Needlets Bispectrum 339 

3.1 The needlets bispectrum 339 

3.2 Unknown angular power spectrum 353 

4 Convergence to multiparameter Gaussian processes 355 

5 Behaviour under non-Gaussianity 362 

References 365 



*We are grateful to a Referee and an Associate Editor for a very careful reading and 
many insightful remarks on an earlier version of this paper. We are also grateful to P.Baldi, 
G.Kerkyacharian and D.Picard for many useful discussions. The research of Xiaohong Lan 
was supported by the Equal Opportunity Committee of the University of Rome Tor Vergata. 



332 



X. Lan and D. Marinucci/Needlets bispectrum 



333 



1. Introduction 

The purpose of this paper is to join two different threads of the recent Uterature 
on random fields on the sphere, namely the statistical analysis of higher order 
angular power spectra on one hand, and the construction of second-generation 
wavelets on the sphere on the other hand. More precisely, we shall be concerned 
with zero-mean, mean square continuous and isotropic random fields on the 
sphere, for which the following spectral representation holds [ ' , ']: for x G S'^, 



where {aim}; ,„ i ^ ~ li • ■ • i ^ is a triangular array of zero- mean, orthogonal, 
complex- valued random variables with variance EjazriiP = Ci, the angular power 
spectrum of the random field. For to < we have aim = ( — l)™a(_m, whereas 
a/0 is real with the same mean and variance. (1) holds in the L'^{S'^) sense, i.e. 
we have 



fi{dx) denoting the uniform measure on the sphere. The functions {Yim{x)} are 
the so-called spherical harmonics, i.e. the eigenvectors of the Laplacian operator 
on the sphere, 



where we have moved to spherical coordinates x =(■(?,(/?), <??< tt and 
< (f < 2tt. It is a well-known result that the spherical harmonics provide a 
complete orthonormal systems for L'^{S'^) [36]. 

The analysis of random fields on the sphere has recently gained very strong 
physical motivations, due to the overwhelming amount of data which is becom- 
ing available on Cosmic Microwave Background radiation (hereafter (CMB)). 
As detailed elsewhere [15], to the first order we can view CMB data as maps 
of the Universe in the immediate adjacency of the Big Bang. The first of these 
maps were provided by the satellite experiment CO BE in 1993, and in view 
of this G. Smoot and J. Mather were awarded the Nobel prize for Physics in 
2006. Much more refined maps have been made available by another NASA 
satellite experiments, WMAP (http://wmap.gsfc.nasa.gov/); still more re- 
fined data are expected from the ESA mission Planck, scheduled to be launched 
in October 2008. These huge collaborations involve hundred of scientists and 
are expected to provide invaluable information on Physics and Cosmology. At 
the same time, these massive data sets have called for huge statistical chal- 
lenges, ranging from power spectrum estimation to outlier detection, testing 




(1) 
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for isotropy, efficient denoising and map-making, handling with missing data, 
testing for non-Gaussianity and many others. 

Among these issues, particular interest has been driven by efficient testing 
for non-Gaussianity. This is due to strong physical motivations (the leading 
paradigm for the Big Bang dynamics predicts (very close to) Gaussian fluctua- 
tions) and difRculties in finding a proper statistical procedure. There is by now a 
wide consensus that the most efficient procedures to probe non-Gaussianity are 
based upon the bispectrum, in the idealistic circumstances where the spherical 
random field is fully observed: see for instance [8, 11, 20, 23, 24, 37]. However, the 
properties of the bispectrum are also known to deteriorate dramatically in the 
presence of missing observations, see again [11]. To handle the latter problem, 
a general approach is to focus on wavelet, rather than Fourier transforms. The 
construction of spherical wavelets has recently drawn an enormous amount of 
attention in the literature, see for instance [2, 27, 38] and the references therein. 
In our view, a particularly convenient tight frame construction on the sphere is 
provided by so-called needlets, which were introduced in [30, 31]; their applica- 
tions to spherical random fields is due to [6, 7]. Needlets enjoy two properties 
which seem especially worth recalling: they are quasi-exponentially localized in 
the real domain and compactly supported in the harmonic domain. A further, 
quite unexpected property is as follows: random needlets coefficients are asymp- 
totically uncorrelated at the highest frequencies and hence, in the Gaussian case, 
independent, see again [(>]. This latter feature is rather surprising in a compact 
domain and makes asymptotic theory possible even in the presence of a single 
realization of a spherical random field. 

Our aim in this paper is to borrow ideas from the bispectrum and the needlets 
literature to propose and analyze a needlets bispectrum, where the random co- 
efficients in the needlets expansion are combined in a similar way to the bispec- 
trum construction. The aim is to obtain a procedure which mimicks the ability 
of the bispectrum to search for non-Gaussianity at the most efficient combina- 
tion of frequencies, at the same time providing a much more robust construction 
in the presence of missing data, as typical of the needlets. The plan of the pa- 
per is as follows: in Section 2 we review some background material on spherical 
random fields, the bispectrum and the needlets construction. In Section 3 we 
introduce the needlets bispectrum and we establish a central limit theorem, in 
the high resolution sense. In Section 4 we go on to establish a functional central 
limit theorem for the integrated needlets bispectrum; in Section 5 we provide 
some preliminary discussion on the behaviour under non-Gaussian assumptions 
and discuss some possibilities for applications and further research. 

2. Some background material 
2.1. A Review on Needlets 

In this subsection we shall briefly recall the main features of the needlets con- 
struction. As mentioned above, needlets were first introduced in the Functional 
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Analysis literature by [30, 31], whereas the investigation of their properties from 
a stochastic point of view is due to [O, G] and [7]; see also [19, 17]. We need first 
to introduce some notation and definitions, which are largely identical to those 
in [6, 31]. 

Given any two positive sequences {aj} , {bj} , we write aj ~ bj if there exist 
c > such that c~^aj < bj < caj for all j. The standard (open and closed) 
balls in §^ are given as always by B{a,a) = {x, d{a,x) < a}, B°{a,a) = 
{x, d{a,x) < a}. For a general A C we will denote by \A\ the spherical 
measure of A. Now fix e > and xi, . . . xn in such that \fi ^ j, d(xi^ Xj) > 
e; the set {xi, . . . , xn} — is called a maximal e— net if it satisfies Vx € 
§2, d(x,S,) < e, U:r,GH,-B(x„e) and Vi 7^ j, B{x,,e/2) n B{xj,e/2) = 0. 
It follows from Lemma 5 in [7] that 

1 < TV < 

For all € Sj, the associated family of Voronoi cells is defined by: 
V{xi) = {x e S^, Vj 7^ i, d{x,Xi) < d{x,Xj)}. 

We recall that B{xi,e/2) C V{xi) C B{xi,e). Also, if two Voronoi cells are 
adjacent, i.e. V{xi) n Vixj) ^ 0, then by necessity d{xi, Xj) < 2e. It is proved in 
[7] that there are at most 6tt^ adjacent cells to any given cell. 

For the construction of needlets, we should first start to define /C; as the space 
of the restrictions to the sphere §P of polynomials of degree less than I. The next 
ingredient are the set of cubature points and cubature weights; indeed, it is now 
a standard result (see for instance [2!)]) that for all j e N, there exists a finite 
subset Xj of §^ and positive real numbers A^fe > 0, indexed by the elements of 
A'j, such that 

V/e/Cz, / f{x)dx= J2 ^Jfe/fefc)- (3) 

Given a fixed B > 1, we shall denote by {^jfe} the cubature points corresponding 
to the space /Cpsj+ij, where [.] represents as usual integer part. It is known that 
{'^j}'jLo '^^^ be taken s.t. the cubature points for each j are almost uniformly 
ej— distributed with Cj :~ kB^^ , and the coefficients {\jk\ are such that Xjk ~ 
cB-2J, card{Xj} « B^J. 

Now let (/) be a C°° function supported in |^| < 1, such that < (/)(^) < 1 
and = 1 if |C| < B-^ , B > 1. Following again [30, 31], we define 

^'(O = 0(|)-0(O>OsothatV|e|>l,^6'(^-|j) -1. (4) 

It is immediate to ver ify that 6(^) 7^ only if 5 < |C| < B. The needlets frame 
{fjk} is then constructed as 



(p,,(a;):=yA^^6(— ) ^ r,™(e,,)y;:;,(x). (5) 
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The main localization property of {^jk} is established in [-'Ul], where it is shown 
that for any M E N there exists a constant cm > s.t., for every ^ S 



cmB^ 



(1 + S. arccos(0,,e))^^ 



uniformly in (j, k). 



More explicitly, needlets are almost exponentially localized around any cubature 
point, which motivates their name. In the stochastic case, the (random) spherical 
needlet coefficients are then defined as 



/3. 



We have immediately 



(6) 



(7) 



i.e., the (weighted) sample mean of the needlets coefficients is identically zero 
at all levels j. The proof is trivial, because 



B^ + i I 



l=Bi-^ m=-l 

The variance of the needlets coefficients is given by 









j aim 








j aim 



L k 



y^^^jkyim{£.jk) 

Yi,n{x)dx 



§2 



^Pjk 



l=Bi-^ 

x,k J2 ^ 

l=Bi-^ 



B3 
I 

'B3 



21 + 1 
G— — P,(cosO) 
in 



21 + 1,^/ I 



l=Bi-^ 



in 



Note that we have cr^^, ~: cr^ uniformly over fc, where 

]Ci, Nj ^card{Xj} . 



'jk ] 



iV, ^ An 

■' l=Bi-'^ 



I 

'B3 



From now on, we shall typically focus on the normalized needlets coefficients, 
defined as (ijk ■= (ijk/<Jj. 

To investigate the correlation, we introduce now the same, mild regularity 
conditions on the angular power spectrum Ci of the random field T{x) as in 
[5, 6]. 
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Condition A The random field T (x) is Gaussian and isotropic with angular 
power spectrum such that, for all _B > 1, there exist a > 2, and 2 
a sequence of functions such that 



c, = i- 



9j 



> 0, for S^-i < I < B^+\ j = 1,2,. 



(8) 



where 



Cq ^ < gj < Co for all j €N, and 



sup sup 

j B-^<u<l 



du 



:9]{u) 



< Cr some Co, Ci, . . . Cm > 0, Me N. 



Condition 8 entails a weak smoothness requirement on the behaviour of the 
angular power spectrum, which is trivially satisfied by some cosmologically rel- 
evant models (where the angular power spectrum usually behaves as an inverse 
polynomial, see again [ I "1], pp. 243-244). For instance, considering positive con- 
stants dj, j = 1, . 



Ci 



. ,p, p > 2, for all _B > 1 condition A holds for 
1 IP 



= r 



ELo dkl"" ~ do + dil 



.dplf 



ELo dkB3^-3P{l/BoY 

i-'g, ( ^ 



gjiu) 



Bi 
uP 



for B^-^ < I < B^+\ 



1,2,.. 



ELo dkBok-3Puk ■ 



Under Condition 8 a crucial and rather unexpected property of the random 
needlets coefficients is established in [(i] , namely the correlation bound 



\Corr{l3jk,P]k')\ 



< 



Cm 



(l + SJd(Ofc,?,fe')) 



M 



(9) 



where d{£,jk,^jk') — SiTCCOs{{^jk,^jk'))- In words, (9) is stating that as the fre- 
quency increases, spherical needlets coefficients are asymptotically uncorrelated 
(and hence, in the Gaussian case, independent), for any given angular distance. 
This property is of course of the greatest importance when investigating the 
asymptotic behaviour of statistical procedures: in some sense, it states that it 
is possible to derive an infinitely growing array of asymptotically independent 
"observations" (the needlets coefficients) out of a single realization of a continu- 
ous random field on a compact domain. It should be stressed that this property 
is not by any means a consequence of the localization properties of the needlets 
frame. As a counterexample, it is easy to construct spherical frames having 
bounded support in real space, whereas the corresponding random coefficients 
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are not at all uncorrclatcd (recall the angular correlation function can be taken 
to be bounded from below at any distance on the sphere). 

We recall briefly two further results that we shall exploit often in the sequel; 
more precisely, ([()], Lemma 10) we note that for M > 2, j G N, V /c, fc', 

Also ([31], Lemma 4-S), for some C^^ depending only on M we have the in- 
equality 



1 < C',j 



(11) 

From the computational point of view, we should stress that needlets are 
not only feasible, but indeed extremely convenient. The implementation can 
be performed with a minimal effort by means of standard packages for the 
analysis of spherical random fields such as HEALPIX or GLESP ([1 and [!()]), 
see for details [2()], where plots and numerical evidence on localization and 
uncorrelation are also provided. 



2.2. Diagram Formula 

To complete our background, we need a quick review on the diagram formula. 
This is material which can now be found at a textbook level (see for instance 
[•34]); nevertheless, we need a brief overview to fix notation. Denote by Hq the 
g— th order Hcrmitc polynomials, defined as 

HJu) = (-l)?e"'/2^e-"'/2. 

We now introduce diagrams, which are basically mnemonic devices for com- 
puting the moments and cumulants of polynomial forms in Gaussian random 
variables. Our notation is the same as for instance in [23, 24]. Let p and Ij, 
J = 1, . . . ,p be given integers. A diagram 7 of order (Zi, . . . , Zp) is a set of points 

{(ii • 1 — J — 1 — ^ — h} called vertices, viewed as a table W = h ^ - ■ - ^ Ip 
and a partition of these points into pairs 

{((i, l),ik,s)):l<j<k<p;l<l<l,,l<s< Ik}, 

called edges. We denote by r(Ty) the set of diagrams of order {li, . . . ,lp). If the 
order is h = ■■■ = Ip = q, for simplicity, we also write T{p, q) instead of r(iy). 
We say that: 

a) A diagram has a flat edge if there is at least one pair {{i, j){i' , j')} such 
that i = i'; we write Tp for the set of diagrams that has at least one flat edge, 
and F-p otherwise. 
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b) A diagram is connected if it is not possible to partition the rows li ■ ■ ■ Ip of 
the table W into two parts, i.e. one cannot find a partition ii'i U A'2 ~ {1, ■ • . ,p} 
that, for each member Vk of the set of edges (Vi, . . . , Vr) in a diagram 7, either 
Vk G Ujgifj Ij , or Vfc G Ujgifj holds; we write Tc for connected diagrams, and 

otherwise. 

c) A diagram is paired if, considering any two sets of edges {(*i, ji)(«2, ^2)} 
{(^3) J3)(*4j j4)}i then ii = 73 implies 12 = «4; in words, the rows are completely 
coupled two by two. We write Tp for the set of diagrams for paired diagrams, 
and T-p otherwise. 

Proposition 1. (Diagram Formula) Let (zi,...,Zp) he a centered Gaussian 
vector, and let ^ij ~ E[ziZj], i,j = l,...,p their covariances, Let Hi-^ , ■ ■ ■ , Hi^ be 
Hermite polynomials of degree li, . . . ,lp respectively. Let L be a table consisting 
of p rows li, . . .Ip, where Ij is the order of Hermite polynomial in the variable 
Zj. Then 

E[UP^^,H,^{z,)] 



Cum[Hi^{zi), . . .,Hi^{zp)) 

where, for each diagram G, rjij{G) is the number of edges between rows li,lj and 
Gum{Hi-^{zi), . . . , Hi^{zp)) denotes the p — th order cumulant. 

We have now all the preliminary material to define our needlets bispectrum 
on S^, as explained in the following Section. 



Err G 



Ger{h....,ip) 

E 



Err 'fc(G) 



3. A Central Limit Theorem for the Needlets Bispectrum 
3.1. The needlets bispectrum 

As mentioned in the introduction, the recent literature suggests that the most 
powerful statistic to search for non-Gaussianity in fully observed spherical ran- 
dom fields is the (normalized) angular bispectrum, defined as 

'''''' ~ ^ \ mi 1712 7713 J JGuGuGi^ 

where the symbol in brackets represents the so-called Wigner's 3j coefhcients, 
which are meant to ensure the statistics is rotationally invariant. Wigner's 3j 
coefficients arise in many different instances, especially in the quantum theory 
of angular momentum (see [3G], where explicit expressions are also provided). 
Up to a normalization factors, they are equivalent to the so-called Clebsch- 
Gordan coefficients, which play an important role in representation theory for 
the group of rotations 50(3), see [2'>] for a much more detailed discussion and 
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probabilistic applications. Following [20, 37], an alternative definition of the 
(normalized) bispectrum can be considered, namely 



where 



f Ti,{x)Ti,{x)Ti,{x) 
Js- ^Var{Ti, {x))Var{Ti^ {x))Var{Ti, {x)) 



Ti{x) := Li^T = / T{y)^Yhn{x)Yi*^{y)dy = Y] a;,„i^,„(a;), 
J 

^ m m 

i.e. we focus on the Fourier projections of the random fields at multipoles 
(^i)^2j^3)- Both versions of the bispectrum have been shown to be extremely 
powerful against non-Gaussian alternatives, indeed there is now a widespread 
consensus that they make up the most efficient statistical procedures to search 
for non-Gaussianity, at least in the presence of fully observed spherical maps 
(see for instance [11, 21, 28]). 

On the other hand, it is also well-known that the performance of Fourier 
methods in general, and the bispectrum in particular, deteriorates quite clearly 
in the presence of missing observations/partially observed maps ([20, 22, 11]). 
A natural idea is then to explore the localization properties of the needlets 
in harmonic domain, together with their robustness in the presence of gaps, 
in order to devise a statistic which might mimick the positive features of the 
bispectrum, at the same time coping with the difficulties brought in by missing 
observations. 

To this aim, wc shall consider the (normalized) needlets bispectrum 



(13) 



^3 

where 

^J1J2J3 (kl , k2: fcs) = I(Cj3fe3 e Vj.,k2 )Mj2k2 ^ Vj,k, ) 



#{k2:k2ev,,,,nx,,}' J3^ 
^ Aj.,^., , otherwise 

where I(.) denotes the indicator function and Vjk is the Voronoi Cell that corre- 
sponds to ^jk; note that for j2 = js we have hj^j^j^ « It is immediate to 
see that (13) can be seen as a natural development of (12), where the convolu- 
tion with the orthonormal projector operator L; is replaced by the (discretized) 
convolution with the frame operator projection \/Xjk J2i b{l/B^)Li. Of course, 
in practice (12) is unfeasible and requires discretization to be implemented. In 
words, we are considering a version of the bispectrum where the exact identi- 
fication of the multipoles is blurred by a form of suitable smoothing, with the 
purpose of a better robustness against missing observations. 

The summation convention in (13) needs some further discussion. In practice, 
for applications the Voronoi tessellation is chosen in such a way to be nested 
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across different scales (this is the case, for instance, for HEALPix [IS], which 
is the standard package for CMB appUcations). Under such circumstances, our 
procedure can be described more expUcitly as foUows: we take 



where k2 = ^2(^3) is the (unique) value of fc2 such that ^j^ks S ^j2fc2' 
ki = ^1(^3) is the (unique) value of ki such that Cj2*;2iCi3fc3 G Vjifci- other 
words, the "finest grid" Xj^ is the one which leads the summation, whereas 
smaller frequency terms are identified with those centres whose corresponding 
Voronoi cells include the points being summed. Note, however, that in the gen- 
eral non- nested case the centre of Vj-^ki needs not belong to Xj.^. In the sequel, 
for notational simplicity we write fci,fc2 rather than ki{k^) ^ k2{k^) , when no 
ambiguity is possible. 

To investigate the asymptotic behaviour of the needlets bispectrum, we shall 
make an extensive use of the Diagram Formula which was introduced in the 
previous section. A crucial point, of course is the determination of the frequencies 
where the needlets bispectrum is evaluated. We distinguish three cases, namely: 

• Case 1) ii + 1 < j2 < - 1; 

• Case 2) ji + 1 < j2 = or ji = 32 < - 1\ 

• Case 3) ji = j2 ^ ji- 

Case 1 corresponds to the situation where all three frequencies are different. 
Case 2) is basically the "squeezed" or collapsed configuration which is consid- 
ered by [■), 2.H], and many other cosmological references; in words, one frequency 
is (much) smaller than the other two. It has been widely argued in the physical 
literature that this configuration corresponds to the highest power region for 
so-called local models of non-Gaussianity. Case 3 corresponds to so-called equi- 
lateral configurations; this case, however, can be largely investigated by means 
of results in [(>] and we report it only for completeness, omitting many details 
in the proof. It should be noted that for case 1) and 2) we focus on frequencies 
that are at least two steps apart, in order to exploit the semiorthogonality prop- 
erties of the needlets systems. Relaxing this assumption implies no new ideas 
and would only make the paper notationally more complicated. 

In each of the three cases we have trivially ^Ij^j^ji = 0. We now focus on 
the asymptotic behaviour; here, asymptotics must be understood in the high 
resolution sense, i.e. we focus on a single realization of an isotropic random field, 
and we investigate the behaviour of our statistics at higher and higher bands. 
The first task is to ensure the statistics arc non-degenerate and do not exhibit 
an explosive behaviour; this is the aim of the next Lemma. While the bound 
from above is quite straightforward, the lower bound is much more complicated 
and settles a question which was left open in [-5], where the lower limit was 
simply assumed to be strictly larger than zero even for the simple case where 
ji = j2 = jz- As it is evident from the proof, the integer K depends on the 
choice of cubature points and of kernel function 6(.); more explicit expressions 




(14) 
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can be found below. Note also that unless the three bands are equal, condition 
b) cannot be satisfied for i? = 2; indeed in CMB applications values of order 
B ~ 1.2, 1.3 are likely to be favoured. 

Lemma 1. Under Condition A, as oo, 
a) For all n < j2 < js, E/j; ,,,3 = 0(1). 
Also, there exist a positive integer K such that 

h) For {.71 = j2 = h] , or +K< .72, B^^ + B^^ > B^^} , {lE/|,,,,,}~' = 
0(1). 

The bounds are uniform with respect to Ji, j2- 

Proof, (a) In the sequel, we shall use the fact that the set of the cubaturc points 
of polynomial spaces with degree less than B^ are a net; we also define 

p := ma,x j, k{B^^^ Xj,k}- The proof of all three cases are similar; we shall focus 
on case 2) ji + 1 < j2 = h- 



Q2]2-2ji ^f^n kiPjik[ (E/3j2 k2 P32 fc^ ) ^ VAjifci 



#{fc^,A:^e V.^fc^nA-.J- 
Since Ajfc ~ for every ^^k, and #{fc2,fc2 G Vj^fei n A-^J ~ S^J^^^j^ ^.j^g 
sum can be readily bounded by 

'32k'^ 



mik, K i^n k2 %2 k')l < ^ E (^^^-^ Z^^-: 

0(1) as j2 ^ 00, 



in view of (10). This completes the proof of part a). 

(b) The proof of the lower bound on the variance is considerably more deli- 
cate. We recall the correlation of the needlets coefficient is provided by 

'i^PjkPjk' 



Ait 
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where d — arccos {£,jk,£,jk') ■ The idea of our argument is to replace the needlets 
coefhcients in the coarsest grid Xj-^ by coefhcients with the same resolution but 
evaluated over the pixels Xj^ n Vji (^j^fe^). This will allow us to circumvent the 
fact that the cubature weights at the smaller frequencies stay constant over a 
Voronoi cell, whereas those corresponding to the highest j's may vary. We shall 
hence consider 



B^ + i I 



Im i^j' k' ) ■ 



For fixed {j',k'), j' < j, l3jk{£,j'k') varies over the pixels in Xj n Vj>{£,j/k')- 
Let us now recall a few properties of Legendre polynomials, that we shall use 
extensively soon (see [ Ui] for details); we have 



sup P/(cos6') = Pi(cosO) = 1, and sup 
ee[o,7r] ee[o.Tv] 



^PKcos. 



< 3L 



As a consequence, for any positive e < 1, there exists a 6 > 0, s.t. ii < 9 < 
5 <e/ (30 , then, 



because 



< cos 



Pi{cos{6q + 6)) -P,(cos6io)| < m < e, 

200 + 



— cos 00 2 sin ■ 



sin - <0. 
2 2 - 



Now fix an integer K such that K > logg Qk/e; if we let {£,j,k}ji; be the cu- 
bature points of polynomial space with degree less than 5-'+^+^, (note that 
we can assume all of these sets are kP^-'— nets), then for any j2 > jijCj2fe2 G 

KOlfel'^ilfc'i) ~ (02fc27?j2fci)| = \d{£,nkl,S.jlk[) - d{£.j2k2,S.j2k'J\ 

It follows that 

m.k.Pnk', - nPUA^j2k2)PU'S^j2kO 

<V^;^ E ^H-^) 

I I 

X C/ ^ ^ E ^™(^J2fe2)^/m(?j2fci) 

m— — / 

Pi {{^jiki , ink[ )) - Pi {{£.]2k2 1 ^2^ )) 



-—I 



= \/XjkXjk' E 
< ey^XjkXjk' E 

i=_BJi-i 



I \^2l + l 



Gi- 



An ^1 



(15) 
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where C is a constant, C — C (k) (see [-'>])■ We are now in the position to 
estabhsh our lower bound. By simple algebraic manipulations, we have 



E/^ ■ ■ 



{EP,,k,PnK - ^f3nkS^T3k3)Plk'S^,3kO) 



E 



k3.k'^£Xj^ 



X k2 d]2 k'^ IE/3j3 fc3 /i,-3 k'^ \J\j3k3 \/^]3k'^ 

^ (e,-3^ ) m.^.p^.k', - w^^, (a^/oa (^,3^ )) 

k3,k!^eX,^ ^'^Oi^^hJ 

X IE/3j3fc3 A/3fc^V ^i3fc3-\/ ^33k'^ ■ 

In view of (15), (10) and standard manipulations, the second and third compo- 
nents can be made arbitrarily small. To bound the first component, we recall 
first two facts involving spherical harmonics (see again ([36, 23] for details), 
namely the so-called Gaunt integral 

■^imi ('^)^2??i2("^)^3^Ti3 {^x)dx 



{2h + l){2l2 + l){2h + l) f h k h \f h h h 
in \ mi 7712 J y 

where for the Wigner's 3j coefficients introduced on the right-hand side we recall 
the properties 

E f V^l, (16) 

^-^ \ mi m2 TO3 / 



mi 7712 m3 



h h h \ ^ 



Q Q QJ >j2l{h+l2>k,h+l2 + h = even}. (17) 

Using the fact that {Xjk} are the cubature weights corresponding to the space 
K-^Bi+i , easy manipulations yield 

Ji 32 33 I1I2I3 mivi2m3 \ / \ / \ / 

^ ^ ] Ylimi (Cj3fc3 )Yl2m2 (Cj3fc3 )-^3ni3 (Cj3fc3)'^j3fc3 



k3 

^ E ^hmi i^33k!^)Yl2Tn2 {^]3k;^)Ylsm3 i^33k'J^33k'^ 



X. Lan and D. Marinucci/Needlets bispectrum 345 
Ji 32 J3 hhh rnim2m3 \ / \ / \ / 

J2 J3 (j;2;3 \ / \ / \ 

(2;i + i)(2Z2 + i)(2;3 + i) 



X Ci^Ci^Ci 



4tt 



X 

mi 7712^^3 



'1 ^2 h \ f h h h 



mi 1712 y Y 
and using o-| « B^^Cbj , B^' ~ h^i = 1,2,3 we get 



^ ^ ^ ^ {2h + l){'2l2 + mh + l) ( h h h 



47r \^ 

'1 \ j2 / ^2 \ ,2 / h 



In the previous argument, we have taken c := infj^j^j, Cj^j^j^^, where 

'1 \ l2 / '2 \ , 2 / '3 



'l+'2>i3 



5ii / \ SJ2 / y _ei3 



It is simple to show that c > 0. Indeed, under (4), we have 
1 

[0,2]^n{lll<1i2<M3} 

> - / dui dU2 / du3{b^iui)b^{u2)b^iu3)} 

^ Jl-So Jui Jl+Sa 

where 60 < 1/4, and b{x) > 1/2 for any x £ [1 — (5o, 1 + (5o] , cq = 

The same argument as before could be used to establish lower bounds when 
ji = j2 < js or ji = j2 ~ js ■ To conclude our proof, we consider the case when 
ji < k = J3- We obtain 
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2 



fei fcaGVjjtjnA'jj 

the two summands corresponding to the cases where k2 , k'2 belong to the same 
or to different Voronoi ceUs, respectively. The first part is equal to 

while the second part is equal to 

c'{^.p)B^^ E E + E E I 

4V„fci,v^-^,,)=o fe^6V^-^,,nA',3 d(v„fci,v^^,,)>o ^ev^^,,nA',, 



ki k2&Vj-^^kif~^^j2 



E E (e E 



^^'(-'/')^E^"-" 

fcl 

, C'in.p)^^ Cm 



EE 



M 



< C {k, p) B-fe-Ji) + C' {k, p, Cm) I ..^''1''" 



= C{k,P,Cm)B-(J--^^^) 

By taking j2 - ji > K > max {[log^ {2K/e}], ^01/02} , where a = C{k), C2 = 
C {k,p,Cm) , and e < ^c/Cm, the argmiient is completed. The proof for case 
(3) is similar (actually slightly easier). □ 
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The following weak convergence theorem is the main result of this Section. 
We stress that the statement could be easily extended to a multivariate Central 
Limit Theorem; however, because this extension would not entail any substantial 
novelty, at the same time making the notation much more cumbersome, we prefer 
to stick to the univariate case. 

Theorem 2. Let T{x) he a zero-mean, mean square continuous and isotropic 
Gaussian random field, with angular power spectrum that satisfies Condition A. 
As ji ^ oo, we have 



EI^ - - 

313233 

Proof. In view of the results in [32] , to establish a Central Limit Theorem for a 
multilinear form in Gaussian random variables, it is enough to investigate the 
asymptotic behaviour of fourth order moments (or equivalently cumulants) , see 
also [14]. Our aim is then to show that, as ji oo, 

For notational simplicity, we shall write 

By the diagram formula we have, for all index sets /: 

4nri4.i|= E n s^^P3.ik^'4\ 

Ue/;=i J 7er(/,3) {(ij)(*','')}e7 
Similarly to [23], we define 

P(7;.7i,j2,j3) = E n s^i'p„{kf>,k^'h 

ii JUsJaV^l I ^^2 I '^3 )"'3i3233 \'^l i '^2 ^ '^3 ) 

SO that 



^^ji32j3 = E '=1 ^33k3 ^313233 {^1 , , k:i)hj^j^j^ [ki , A:2 , fcs) I 

['4 3 

= E ' ' 'E''^ I nn''^Ji'=i^JiJ2j3(^l ; ^2 '^3 )^Jij2j3(^l '^2 '^3 ) 



,(4) yi=ii=i 



I E + E 1^(7; 



where Tc is the set of all connected diagrams. To conclude our argument, we 
only need to show that 

E Pir.ji.h.h) = 0{B-'''^), as ji oo, (18) 

7Grc(4,3) 
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and 

E P(7;J1,J2,J3)= E p(t,JuJ2,J3) = 3(EII,,,,?. (19) 
7e%(4,3) 7erp(4,3) 

(19) is an immediate consequence of the definition of E/|^^^^^ and trivial com- 
binatorial manipulations (see [34]). The result in (18) is proved by splitting con- 
nected diagrams into those with or without flat edges. Diagrams with flat edges 
are dealt with in Lemma 3, while those without are dealt with in Lemma 4. We 
stress that, on the contrary of what is often the case when the diagram formula 
is applied, terms with flat edges do not vanish, due to correlation among differ- 
ent locations in the spherical needlets coefficients. This completes the proof of 
the main result. □ 

Lemma 3. For a connected diagram with flat edges, 7 G Tcf{^, 3), we have 

P(7;ji,j2, is) = 0{B^^^^), as ji 00. 

Proof. We write {{k^^'^K jb)}a=i,...A,b=i.2,3 for the elements in our diagram, a 
and b denoting the row and column indexes, respectively. We recall also that 
^2 = ^2(^3)7 ki = ki^k^), as explained earlier. For Case 1), i.e. ji < i2 ~ 1 < 
j3 — 2, since E{(3j^i^^(3j.^k2 } = for every k2 G Xj^ , k^ G Xj^ ,it is easy to see that 
jo(7; ji, j2, js) = 0. For Case 2), i.e. ji + 1 < j2 = J3, we assume (without loss of 
generality) that a flat edge is present in the first row of the diagram, i.e. we let 
{(^2^\ i2)(fc3^\ ^3)} G 7- By the same argument as in (7) we obtain immediately 

B^n~^n n E n '5^;>.(fcf',^f^) 

ie{2,..,4}fc(>)^^l{(j,;)(»',;')}G7, 
.(1) L(2)^ 



y — V ■'^^^ = 



On the other hand, under ji = ^2 < Js ~ again we assume a flat edge 
{{k^i\ ii){k''i\ h)} € 7; by necessity, there should exist another flat edge in the 
graph, and w.l.o.g. we take it be in the fourth row, i.e. {{k^\ jijikiy' , j2)} G 7. 
Then we have ^ . , (o = ^ . , (o , and the resulting term can be bounded by 

n Ek.3(fcr\^f)P.3(fcr,4'^)|p,\(fcP\^p^) 

E ^ ) E ^^3 i^3 ^ ^3 




B4J3 



X Cm 
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C 



Finally for Case 3), the argument is analogous; more precisely, components with 
diagrams in rci?(4, 3) can be bounded by 



iei fc(») 



fe(2),fc(3) IfeCl) fe(4) J 

where in the third equation we used (9). Thus the proof is completed. □ 
Lemma 4. For a connected diagram without a flat edge, 7 G T^-p^-i, 3), we have 

p{i\:h,:h.h) ^0{B~^^/'^), as ji ~^oo. 

Proof. Connected diagrams without flat edges and with four nodes can be par- 
titioned into two classes, i.e. so-called cliques, where each vertex is connected to 
all three others, and terms with loops of order 2. We focus on the former class; 
without loss of generality, we can express pi'j', ji, j2, js) by 

n E (k['\k^^%, ik?\k['^)p,. (fc^^ ) 

^ Pis (^3 ''^3 '')Pi3(^3 '''^3 '') 
4 

xTTa- ■ ■ (k^^^ k'^^ k'-^hh ■ ■ (k^^^ k'-^^ k'-^h 



By means of (9) we readily obtain the bound 

4 



c 



1 



1 



(l + i3^id(fc(^\fcf'))*^ {l + Bndik^{'\k[^'>))^ 



(1 + B^^d{k^^\k'i^)Y' (1 + BJ2d(fc(2)^ ^(4)))M 



1 



1 



(2) 1,(3) NU/' 



(20) 
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Consider first Case 2), with ji + 1 < j2 = js- Using inequality (11), (20) can be 
replaced by 

^4j2-4ji I I 

t^,^,^^^ (l + i?^-^d(fc(i\fcf ))^^(l + B^-^d(fcf\fcf ))M 

1 1 



(1 + Bi^d{ki^\4^^))'^'' (1 + Br^d{k'^^\kf^)Yi 



"•2 ' 2 ^^2 



(1 + BJ2d(A:^2\ /fc(4)))M (1 + Bnd{k'^^\k^^'^))M 



< 



K 1,(2) V-^ r ^ "^,"'2 ' '^2 

c ^ ( c 



M 



B'^\,t.^{l + Bndik^^\ki'^W^' 



Likewise, for ji = j2 < j3 — 1 , we have the bound 
C A'sr^ 1 1 



HE 



B^'' t\ (1 + Bnd{k^^\k^^^)Y' (1 + Bnd{kf\kf^)Y' 

1 1 1 



(1 + Bnd{k'^^\kf^)Y^ (1 + Bnd[kf\kf'^)Y' (1 + Br^d{k'i\kf^))^ 
1 

^ 

(l + BJ3d(fcf\fcf)))M 



B''' t=i (1 + Bi^d{k^^\k^^^)Y' (1 + i?Jid(fcf \ fcf )))M 



(1 + SJ2d(fcJ^\ fcp^))^^ (1 + B3^d{k^^\kf^)Y^ 



E 
E 



1 



^1, (l + SJ3d(fc^'^/fc^''M)M 



;.(4) 

1 



(1 + S^3d(fcf ))M 



< 



X. Lan and D. Marinucci/Needlets bispectrum 351 



E E 



X 



^* ti^X;-' + -Bj.<i(t!'>, tf >))«' (1 + B»<i(t!'>,tp>))" 



■fe 



(■1) 



(1 + B^^d{k[^\k['^'>)Y' (1 + BJ'2d(fcp\ fcf '))*^ 



This concludes the proof for Case 2); the proof for Case 3) could be implemented 
along identical lines. 

The analysis for case 1) ji + 1 < j2 < js — 1, i-C. the case where all three 
frequencies differ, requires considerably more care. As before, let Vjjk'i\r) 
be the Voronoi cells associated to ku \ and we recall it satisfies B{xi,r/2) C 
Vj^{xi) C B{xi,r), ku^ S Xj^. Our idea is to partition (20) into four elements, 
as follows: 

(20) < ;^nE{Wi'\fci'^ >3r),d(A:f),fcf) >3r)) 

+ l{d{k[^\k^^^ > 3r), d(fcf \ fcf ' < 3r)) 

+ I{d{k[^\kf^ < 3r), d(fcf \ fcf ' > 3r)) 

+ I{d{k^^\k^^^ < 3r), fcf ^ < 3r))} 

1 1 1 

^ 

1 1 1 



(1 + BJ2rf(fc(2)^/j{4)))M (1 + SJ3d(fcf\4'*)))A^ (1 + BJ3d(fc(2)^yt(3)))M 

The first three summands are easy to bound, indeed it is enough to notice that 

4 



c 

1=1 k: 



l[Y.{id{k['\k['^ >3r),d{kr\kr' >3r))} 



[1 + Bnd{kf\kf'^)Y' {l + Bi^d{kf\k^^^))^i [1 + Br^d{k''^\kf^)Yi 
1 1 1 



(f + B3'^d{k!^^\k^^'^)y'' (1 + BJ3d(fcW, fcW))M (1 + sj3d(fcf \ fcf ^))^^ 



(l + BJ3d(fc(2)^fc(3)))M 
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because 



< TB^JS r < r D(M-2)(iji-j3) 

- X S^^^30Af < ^B 

by taking r = B~^^/'^ , compare Lemma 10 in [G]. The argument for the second 
and third term is analogous. Concerning the last summand. we recall that 

Card{k' e Vj,/2{x^) Xj,} w 52(^3-^1/2)^ (2I) 

for every k € Now denote 

^(k-Ja) := {(^1,^2, ^3, ^4) : fci,fc2,^3,^4 e Xj^,ki e Vj^/2{k),d{ki,ki) 
< 3r,dik2,k3) < 3r}, 

where r = kB^^^^. Heuristically, the idea is to split ^l{k) into regions where 
(fci, A:2, ^3, fc4) are each "close" to all three others, and regions where they are 
close two by two but the two pairs need not belong to the same neighbourhood. 
More precisely, we take f2(fc; js) C Ai U A2, where 

Ai = {(fci, A:2, /ca, h) : fci, . . . , A;4 e Ber{k), ki e Vj^/2{k)} 

and 

A2 = {(fci,fc2,fc3,fc4) : h e V,,/2(fc), k2 e SyBerik)}. 

We can hence define 

Qiik-Js) := r!(fc; J3) n Ai, ^ialfc; ja) := ^(k-Ja) n A2. 

In the region Qi{k; J3), the idea is to keep k fixed and then proceed by evaluating 
the cardinality of Ber{k); in view of (21), this leads to 

(20) ^ B^ ^ , S.o ^ (1 + Bi^dik,, k,))M (1 + i3.3d(fc2, fca))^ 

/ R4(i3-Jl/2)+ii \ 
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by (9) and Lemma 4-8 in On the other hand, in the region Q2{k), we 
exploit the fact that d{ki, ^2), ^(^3, ^4) > 3r = 3kB^^^^^, so that we obtain the 
upper bound (for some C > 0) 

1 1 1 

{l + B^^d{k^r^^\k!^^^))M {1 + Br^d{kl^\k^^^)Y' (1 + BJ3d(fcf\/!cf)))M 

^ ^ ^ 1 1 

- 54j3 (l + C(Bi^-3^/^))'^^^ (1 + Bi^d(ki,k4)W 

1 

= o[ )=o(i?---)). 

The proof for the remaining terms is very similar and hence omitted for brevity's 
sake. □ 



3.2. Unknown angular power spectrum 

As the final result of this Section, we wish to focus on the case where the variance 
of the needlets coefiicients is unknown and estimated from the data. A natural 
estimator for aj is provided by 

where as before Nj = card{Xj} « B^^ . We define our studentized statistics as 
Pjk '■= Pjk/^j and we then consider 

^jlhh — E/ f^hkiPj2k2Pj3k3^jlj2j3 (^li ^2, kz)hj^j2j.-^ {ki, fc2, ^3). 

fc3 

Our next result shows that this studentization procedure has no effect on asymp- 
totic behaviour. 

Theorem 5. As ji 00, we have { j ^ j, ^ } ^^ji^j,^!, — ^ 1 probabil- 
ity, and hence 

Innis -^d VV(0,1). 

Proof. By a standard application of Slutzki's theorem, the weak convergence 
result follows immediately from the consistency of the variance estimator. We 
provide a proof for the three cases separately. 
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For case 1), i.e. ji <j2 < js we have immediately ^[{aj^a'^^a'j^} cr|^o'|^o'|J 
1. Moreover 



ji 32 j3 ,=i^^_,^,4^.^^,eAr,, 



where 



Cjii-i .Cjjfcj e-^ji VfjifciS'^ji / 



Hence 



m m ' 

11 J2 J3 



x(A^ + o(iv,3))444-44-:' 



33 



o 



'^31 "^3 2 '^3 3 



O 



For case 2), we focus on the case where ji = j2 < js! the remaining part of the 
proof is nearly identical. First note that 



Likewise 



E 



E 



2(47r)^ 
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Nf m 

31 12 



E 



Cjifci e-^ji ^ijiki&Xj-^ / 



4 



.4 \ x2 



By (9) and [31] (Lemma 4.8), we have 



< 



4n E 



(1 + B^^d{k\,k\)f (1 + fc3))3 (1 + Bnd{k\,kl)f 



< a 



E 



Hence the variance is bounded by 

.4 \ \ 2 



1 1 



1 



The proof for case 3) is an easy consequence of results by [G, 7]. 



□ 



4. Convergence to multiparameter Gaussian processes 

Our aim in this Section is to extend the previous results to functional conver- 
gence theorems. The motivation for such an extension can be easily explained. 
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Indeed, from the applications points of view, practitioners are typically inter- 
ested not only at the possible existence of non-Gaussianity and/or other fea- 
tures, but also to their location in frequency space. If we focus for instance 
on cosmological applications, which are the main driving rationale behind our 
work, it is important to recall that the existence of possible non-Gaussianities 
takes a very different meaning according to the scales where they are located, 
so that a suitable statistical procedure should provide information not only on 
their existence, but also on their position in the frequency domain. As an ex- 
ample, a huge debate has arisen in the Cosmological literature on the possible 
existence of a non-Gaussian "Cold Spot" in CMB data, much of the related lit- 
erature concerning the determination of the angular scale of such features, see 
for instance [12, 13]. Concerning this feature, it may even be of interest to test 
for Gaussianity only on a subspace of the sphere (this is indeed what happens 
in practice, because of missing observations). The modification of (13) under 
these circumstances is straightforward: we would simply restrict our sum to a 
subset of the cubature points. Our following discussion would be asymptotically 
unaltered. 

In [11, 23], it was proposed to build alternative forms of partial sum process 
from the bispectrum Bi^i^i^, and to use them as a probe of non-Gaussianity at 
various scales. All different proposals were univariate, in the following sense. As- 
sume the resolution of the experiment is such that frequencies up to ^ = 1, . . . , L 
are observed; the partial sums were then run only on a subset of configurations 
with cardinality L, whereas the number of multipole combinations {hyhjh) 
which would be available for statistical analysis is in the order of L^. One of 
the reasons for this restriction had to do with computational complexity: the 
evaluation of even a single bispectrum statistic is extremely time consuming, 
so that the exploration of all possible configurations is likely to be unfeasible 
even on the greatest supercomputing facilities. On the contrary, ncedlcts are 
extremely convenient from a computational point of view, and there is no ob- 
stacle in considering larger frequency configurations, provided of course that the 
triangle conditions are satisfied. 

We shall hence focus on two possible partial sums processes, which correspond 
broadly to cases 1 and 2 of the previous section; more precisely 




[Lri] [Lr2] 



(22) 



jl = l m=0 




[Lr] Ni W(mi) 



(23) 



where 
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-ftT > is some integer satisfying the constraint determined in Lemma 1, 
Ni = max {to ; 1 + > 5^^'^'+"} 

N{x) := max {x : log^ (l + - 2K - rm > 0} . 

Theorem 6. a) As L ^ oo 

JiL{ri,r,) =^ W{n,r2), in D[0, 1]^, (24) 

where W{.) is two-dimensional Brownian sheet, i.e. the zero mean Gaussian 
process with covariance function EW{ri,r2)W{si, 82) = {ri A Si)(r2 A 52)- 
b) As L ^ 00 

J2L{r) ^ W{r), m i?[0,l] (25) 

where W^(.) is standard Brownian Motion. 

Here, denotes weak convergence in the sense of ['.)], I?[0, 1]^ , p G N is the 
usual multidimensional Skorohod space. 

Proof. We start from (24); as usual, we need to establish convergence of the 
finite-dimensional distributions and tightness. Obviously, 

£^JiL(ri,r2) = 0, 

and because ^^[iji j2^; j^] = 
EJiL{ri,r2)JiL{si,S2) 

[Lri] [Lr2] [Lsi] [L.S2] 



Jjl / J / J / J / J ^^ji,ji+K+m,ji+K+mJ-ji,ji+K+m',ji+K+m' 



ji=l m=Q = l m'=0 
^ [Lri]A[Lsi] [Lr2]A[Ls2] 

^ ]^ X! X! ^^ji,n+K+ni,ji+K+m 

m— 

lLr2] A [LS2] 
= J2 ^ ^''2 A S2J [ri A sij . 

ji=i 

To establish Gaussianity, we can again rely on the results by [-i-'] and proceed 
with the bounds for the fourth-order cumulants. As the computations are very 
much the same as in the previous Section, we omit the details for brevity's sake. 
To consider tightness, we use the classical criteria given for instance in [35]. 
Define first the two-dimensional increments 

JiLiisi,ri] X (s2,7'2]) := JiLiri,r2) - JiL(ri,S2) - JiL(si,r2) + Jil(si,S2). 
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It is again a standard computation to show that, as L cx), 

,ji+K+m,ji+K+m 

[ n=[Lsi] + lm=[Ls2] + l 

= -^{[iri] - [Ls,]}{[Lr2] - [Ls^]} 
< 4{ri - si}{r2 - S2}. 

We can then estabhsh tightness by showing that 

1) Si<ti< ri,i =1,2 

^ / [Lti] [Lt2] [Lt2] _^ ^ \ 

^ E E E hihhij[j'2j'2] 

^jlJ[ = [Lsi] j2=jl+K+m, j'^=j[+K+m' , / 
m=[Ls2] + l m' = [Ls2] + l 

/ [iri] [Lr2] [Lr2] \ 

X E E E innnij[ji2J^ 

Vii,iJ=[iti] j2=ii+i<'+m, j^=jj+/c+„i', / 

m=[Lt2\ + l m' = [Lt2\ + l 

^ / [iti] [Lt2\ \ 

E E ^^ji,3i+K+rn,]i + K+rn\ 
\ ji = [Lsi\m=[Ls2\ / 

/ [i'-i] [Lt2\ \ 

x( E E ^^j[j[+K+Tn',][+K+r,i' j 

\j[ = [Lti]rn' = [Lt2] ' 

< 16(ri - ti)(r2 - i2)(ii - si)(t2 - S2) 

< 4(ri - si)2(r2 - S2)'. 

2) si < ri,S2 < i2 < ^2, 

-B[JiL((si,ri] X {s2,t2])JlL{isi,ri] X (t2,f2])]^ 

^ / [Lti] [Lt2] [Lt2\ ^ \ 

E E E hlj2j2^3[3'23'2] 

m=[Ls2] + l m' = [Ls2] + l 
/ [Lri] [Lr2] [Lra] \ 

X ( E E E ^3l32j2lj[j'2j'2 1 

\jlj'i = lLsi] j2=jl+K+m, j'^=j[+K+m' , / 
m=[Lt2] + l m' = [Lt2] + l 

^ / [-f-'-l] [it2] \ 

^X^l E E -^-^ijl+^f+m.il+^f+m ) 
V Jl = [isi] m=[Ls2] / 

[Lri] [Lr2] 

X 



E E ^^.i- 

\i;=[Lsil m' = \Lt2] 



[+K+m',j[+K+m' 

Kj[ = [Lsi] m' = [Lt2] 
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^ [Lri] [Lt2] [Lr2] 

+ EE 

j<*' = [Lsi] + l, m<*' = [Ls2]; m<*' = [Lt2]; 
i=l ...4 i=l,2 i=3,4 



7erc \ i=l 



For the first part it is easy to see that it is bounded by 16(ri — si)^(r2 — i2)(^2 " 
S2); for the second part, for ji\j2^ in each of their domain, we have, 



4 

Ji J2 ^2 



J2 



76r2c *=i 

where Tie denotes the graphs with cliques (aU nodes connected with all others), 
where T2C refers to graphs with loops of order two; these two disjoint classes 
cover all possible connected graphs with four nodes. We have 

^ [Lri] [Lt2] [Lr2] , 4 ^ 

E E E E ^(^^n^'-'jf'+K+rnWjW+K+mW 



j;''=[Lsi] + l, m''' = [-Ls2]: m(') = [Lt2]: 7e{/''S2} 
'i=l,...4 i=l,2 1=3,4 

[Lri] \Lt2\ {Lr2\ 

E E E l<C'(ri-.if(r2-t2)(t2-S2) 



L4 

Ji,j; = [i«il + 1 m<*' = [Ls2]; m<*' = [Lt2]; 

i=l,2 i=3,4 

from which we obtain 

E\Jxl{{sx,ti\ X (s2,i2])JiL((si,ri] x (i2,r2])]' < C(ri 

Similarly, we can get the same result for 
3) si < ^1 < S2 5: ?'2 5 that is 

£;[JiL((si,ti] X (s2,r2])JiL((ti,ri] X (s2,r2])]2 < C{r2 - s^fir^ ~ t{)(t^ ~ s^) 

This concludes the proof of (24). 

For (25), we start again from the convergence of the finite-dimensional distri- 
butions; for notational simplicity, we stick to the univariate case. It is obvious 



X. Lan and D. Marinucci/Needlets bispectrum 



360 



that £^J2l(^) = 0; on the other hand, 

EJ2L{r)J2L{s) 

[Lr] [Ls] ( Ni N{mi)N{m[) 

- EE E E E 

E-^jl Ji + A'+mi J I +2 A'+mi +ni2 d'^ +K+m[ ,j J +2if +m'j 

^ [Lr]A[Ls] Ni N{mi) 

~ r sr^Ni 7777 \ , 1 \ $Z $Z $Z ^^jidi+K+mi,ji+2K+mi+m2 

= Y {[Lr] A [Ls] - 1) ^ [r As], if min{r, s} > 0. 

For Gaussianity, we analyze once again fourth-order cumulants, i.e. the con- 
nected components in the expansion of the fourth moment. As before, we need 
only focus on connected diagrams with four nodes, which can be partitioned into 
two classes: the cliques, where all nodes are connected with all three others, and 
diagrams with a loop of order 2. As before, these terms can be bounded by 



1=1 

because for instance 



E 



1 1 1 



(1 + d{ki^^ , fcf ^ ))^^ (1 + B^'^'^ d{ki^^ , 4^^))^ (1 + bA'^ d{kf^ , k^^^))^ 
1 



{l + Bi'^''d{kf\kf'^)Y^ 



nit (1 + B-^'^U{k'i\k'i^)Yi 



X 

k. 



E 



'3 
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To sum up 



4 f [Lr] Ni 

n E E E 



[Lr 



< C[Lr]iJ2{N{m^) + l)\ J2 B^^^/' ^ 0{L). 

\mi=0 / = l 



It follows easily that 



EJtir) 



3{El 



jl -jl +K+mi ,ji+2K+mi +m2 



[Lr] Ni N(mi} 

E E E 

4 f [Lr] N, ^('"5")' 

+n E E E > 



y 



^ME™Uo(^(-i) + i) 



= 3|i?Ji(r)|^ + 0(^-j, 

which is enough to conclude the proof for the finite-dimensional distributions, 
in view of the standard argument from [32] that we used before. 

To conclude the proof, we need only to consider tightness in D([0, 1]). Note 
that j2 jg/jj j/] = ^jl^jl^jl^ the variance of J2L in [s,r] is provided by 

[Lr] Ni Af(mi)|of2 

EiJ..M-j..(.)p= E E E ry^w^^Trr ^^^^-^^ 

Now we establish our tightness criterion. For any 0<s<t<7'<l, 

E|J2L(r) - J2L{t)\''\J2L{t) - J2l(s)|' 

[Lr] JVi W(mi) |n,rt 
_ ^-'ji Ji+if+ini,ji+2i<'+mi+in2 



ji = [Lt] mi=0 m2=0 

[Li] TVi JV(m;) gj2 



[Lr] 



[Lt] 



4 iVi Af(mi") 



E E n E E 



j<'' = [Lt],j<'' = [Ls], 
i=l 2 z=3,4 



m<''=0 m!,''=0 



X 
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. . P('y',Y\f—T I -(') I I (') -WiTr^i (O I (o) 

(iE™Uo(^(™i) + i))' 

< ^{[Lr]-[Lm[Lt]-[Ls])<4ir-sr. 
Thus we finished the proof of tightness. □ 

5. Behaviour under non-Gaussianity 

In this final Seetion, we shall provide some quick and informal discussion on 
the behaviour of our statistics under non-Gaussianity; see [33, 2G] for other 
applications of the needlets to cosmological data analysis. There exist of course 
a huge variety of non-Gaussian models for spherical random fields, and we shall 
delay a much more detailed treatment to future work. Our purpose here is 
different, i.e. we want to provide some heuristic discussion on the expected 
behaviour of our procedures for physically motivated non-Gaussian models. This 
will provide some guidance to practitioners for applications to CMB data, which 
are currently under way in a separate work. 

We start from the expected value of the needlets bispectrum, which is pro- 
vided by 



¥J- ■ ■ 



-7== ^/^JikiP3^k^(^j3kJlil2l3(kl,k2,k3) 

felfc2fc3 



VWm \B'-J\B" 



■'^ li,l2,l3=Bi-^ mim.2m3 ^ ^ ^ ^ ^ 



X 



X 



mi 1712 "I3 yl^OOOyV 47r 

2^7 Yi^rrii i£,jiki)Yl2m2i^j2k2)Yl3m3i^j3k3)Sj-i^j2j3{kl, fc2, ^3) 



\ kik2k3 ) 

Here, we recall that bi-^i^i^ is the so-called reduced bispectrum (see for instance 
[20, 23]), which collects the non-Gaussian component in the third order moment 
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IE(a;imiaz2m2ai3m3) , i-e. by definition 

(^/i?ni ^l2m2 ^Isfn^ ) 



h h h \ f h h h\ {2li + l){2h + l){2h + l) 

Oil/2/3- 



nil "^2 "^-3 J \ ^ ^ ^ J \ 47r 

In the cosmological fiterature, a very popular model for bi-^i^i^ is provided by 
the so-called Sachs- Wolfe bispectrum ([-!'], equation (21)). which yields 

^hhh = —(^Jnl {Cl^Ci^ + Ci^Ci^ + Ci^Ci^} , 

where /tvl is a physical constant (see for instance [8, 37]). The Wigner coeffi- 
cients on the right hand side ensure that the expected value E (ai^mja/^m^ 0(3^3) 
is rotationally invariant under a change of coordinate, an obvious consequence 
of the isotropy of the random field. For our purposes below, it is sufficient to 
recall that 



I I l\ (-l)-3//2 / I i + (-l)-'o+ 



oooy I '^00 J Vi ' 

Now exploiting again the cubature formula (3) we obtain as before 

\ kik2k3 > 



(26) 



Yl,„^A0Yl2rn2mi3msi0d( 

§2 



^1 h k \ fli h l3\ /(2;i + l)(2Z2 + l)(2/3 + l) 



whence 



mi 1712 / \000/V 47r 



V J \B^^ J \ / " ' V 



(2^i-H)(2/2 + l)(2^3 + l) Y '''' 

in\Ta2ri\3 ^ 

B^^ J h\J h\J h\, (h h ^3' ' 



■'^ ■'^ ■'^ ;i,/2./3=si-i \ / \ / \ / \ 

(2;i-f i)(2Z2 + i)(2;3 + i) 



47r 

in view of the orthonormality properties of the Wigner's ij coefficients. To keep 
the analogy with the cosmological literature, we shall focus on "equilateral" and 
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"squeezed" configurations, see [3, 23]. In the equilateral case ji = j2 — js — j 
we have 



J \Bi J \Bi 



h h h V {2h + l){2l2 + l){2k + l) 



y An 
Now recall / w cr^ w C%^B^i, bi.i^^ w fNLli°'l2°' so that, using also (17) 

(2Zi + l)(2Z2 + l)(2l3 + l) 



X 



An 

o /; \ /7 \ / 1 \ r 7 — Q7 — Q / J J 1 \ 



This suggests the expected value of the needlets bispectrum can either diverge 
or converge to zero, according to the asymptotic behaviour of the angular power 
spectrum; in particular, it does diverge for all a < A. On the other hand, for 
ji < < j2 = js by an analogous argument we obtain 

VT ~ /^Li?^- 

'^^313232 - Q-j^{a/2-l)B-32{a-2) 

JnlB^'- 



X 

ll,l2-l3=B3-^ 

InlB-^^'/^B^^ 



y bi -^]bi ^]bi ^]b-^'''b-'''''b'^ 

^ \ B31 \ W2 \ R32 



As for the usual bispectrum, the previous computations suggest that the power 
is maximized by "squeezing" frequencies, i.e. maximizing the differences between 
the "side lengths" ji and j2. This is the same sort of qualitative result which 
was found for the bispectrum in [23] and successfully applied to CMB data 
in [11]. Our heuristic calculations in this Section suggest very clearly that the 
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necdlcts bispectrum may enjoy the same good power properties, at the same 
time healing the difficulties that were met in [11] in the presence of missing 
observations. These claims will be soon scrutinized on simulations and on real 
data. 
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